THE FOREST ASSOCIATED WITH THE RECORD PROCESS ON A 

LEVY TREE 



ROMAIN ABRAHAM AND JEAN-FRANQOIS DELMAS 

f- ^ Abstract. We perform a pruning procedure on a Levy tree and instead of throwing away 

' the removed sub-tree, we regraft it on a given branch (not related to the Levy tree). We prove 

that the tree constructed by regrafting is distributed as the original Levy tree, generalizing 
a result of Addario-Berry, Broutin and Holmgren where only Aldous's tree is considered. 

QAs a consequence, we obtain that the "average pruning time" of a leaf is distributed as the 
height of a leaf picked at random in the Levy tree. 

t> 

p? 

Levy trees arise as the scaling limits of Galton- Watson trees in the same way as continuous 
. state branching processes (CSBPs) are the scaling limits of Galton- Watson processes (see [T7] . 

Chapter 2). Hence, Levy trees can be seen as the genealogical trees of some CSBPs, [23J. 
One can define a random variable T in the space of real trees (see |2U[ \T§\ [TB] ) that describes 
the genealogy of a CSBP with branching mechanism tjj of the form: 

<N ! „,./^ _ x , o\2 , f f-Xr 



1. Introduction 



> 1 4>(\) =aX + flX 1 + / e" Ar -l + Ar ) n{dr) for A > 0, 

[C ! J(0,+oo) v ' 

with a > 0, (3 > and ir a cr-finite measure on (0, +oo) such that f, n Jr Ar 2 )ir(dr) < +oo. 
£S) \ We assume that either f3 > or vr((0, 1)) = +oo. In particular, the corresponding CSBP is 

^ ' sub-critical as ^'(0) = a > . In order to use the setting of measured real trees developed 

in [1], we shall restrict ourselves to compact Levy trees, that is with branching mechanism 

satisfying the Grey condition: 

+°° dv 

~TT\ < + °°- 

^ \ This condition is equivalent to the compactness of the Levy tree, and to the a.s. extinction 

in finite time of the corresponding CSBP. 

In [6], a pruning mechanism has been constructed so that the Levy tree with branching 
mechanism ip pruned at rate q > is a Levy tree with branching mechanism ijj q defined by: 

ip q (X) = ip{\ + q) — ip(q) forA>0. 

This pruning is performed by throwing marks on the tree in a Poissonian manner and by 
cutting the tree according to these marks, generalizing the fragmentation procedure of the 
Brownian tree introduced in [9]. This pruning procedure allowed to construct a tree- valued 
Markov process [2] (see also [10] for an analogous construction in a discrete setting) and to 
study the record process on Aldous's continuum random tree (CRT) [Tj which is related to 
the number of cuts needed to reduce a Galton- Watson tree. 
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This problem of cutting down a random tree arises first in [28] : consider a rooted discrete 
tree with n vertices, pick an edge uniformly at random and remove it together with the sub- 
tree attached to it and then iterate the procedure on the remaining tree until only the root 
is left. The question is "How many cuts are needed to isolate the root by this procedure"? 
Asymptotics in law for this quantity are given in [28] when the tree is a Cayley tree (see also 
|11] [12] in this case where the problem is generalized to the isolation of several leaves and not 
only the root) and in [25j for conditioned (critical with finite variance) Galton- Watson trees. 
A.s. convergence has also been obtained in the latter case for a slightly different quantity in 
[1] using a special pruning procedure that we describe now. 

Let T be a Levy tree with branching mechanism ip and mT(dx) its "mass measure" sup- 
ported by the leaves of T . We denote by Pf the distribution of the Levy tree corresponding to 
the CSBP with branching mechanism tp starting at r and by the corresponding excursion 
measure also called canonical measure (in particular, Pf can be seen as the distribution of a 
"forest" of Levy trees given by a Poisson point measure with intensity rN 1 ^). The branching 
points of the Levy tree are either binary or of infinite degree (see [18] , Theorem 4.6) and to 
each infinite degree branching point x, one can associate a size A x which measures in some 
sense the number of sub-trees attached to it (see © in Section [23]) . We then consider a 
measure on T defined by: 

zeB roo (T) 

where is the length measure on the skeleton of the tree, Br 00 (7") is the set of branching 
points of infinite degree and 8 X is the Dirac measure at point x. Aldous's CRT corresponds 
to the distribution of T under N^, with ^(A) = ^ A 2 , and conditionally on m^(7~) = 1. In 
this case Br oc (T) is empty and thus {dy) = £^(dy). 

Then we consider, conditionally given T, a Poisson point process M^~(d9,dy) of marks on 
the tree with intensity 

l [0 , +O o)(0)defi T (dy). 

Parameter y indicates the location of the mark whereas 9 represents the time at which it 
appears. For every x S T, we set 

9{x) the first time 9 at which a mark appears between x and the root. 

We consider the average of these first cutting times over the Levy tree: 

6 = J 9(x)m T (dx). 

It has been proven in pQ (Theorem 6.1 and Corollary 5.3 with ip{u) = v? /2) in the frame- 
work of the Aldous's CRT, that if we denote by X n the number of cuts needed to isolate the 
root in the sub-tree spanned by n leaves randomly chosen, then a.s. lim n _^ +00 X n /L n = 0, 
with L n ~ y/2n the total length of the sub-tree. Moreover, the law of in that case is a 
Rayleigh distribution (i.e. with density xe~ x //2 l{ a; >o}). The distribution of is also the 
law of the height of a leaf picked at random in Aldous's tree. This surprising relationship is 
explained by Addario-Berry, Broutin and Holmgren in [7j, Theorem 10. The authors consider 
a branch with length 0, and when a mark appears, the tree is cut and the sub-tree which 
does not contain the root is removed and grafted on this branch (the grafting position is 
described using some local time). Then the new tree obtained by this grafting procedure is 
again distributed as Aldous's tree. 
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The goal of this paper is to generalize this result to general Levy trees. We consider a 
Levy tree T under and we perform the pruning procedure described above. When a mark 
appears, we remove the sub-tree attached to this mark and keep the sub-tree containing the 
root. We denote by T q the resulting tree at time q i.e. the set of points of the initial tree T 
which have no marks between them and the root at time q: 

T q = {xeT;6{x)>q}. 

According to |6j Theorem 1.1, T q is a Levy tree with branching mechanism tp q . We consider 
Q q the average of the records shifted by q over the Levy tree T q - 

Q q = [ (9(x)-q)m T (dx). 

JTq 

Remark that a.s. T q C T and hence Q q < 0; 




Figure 1. Pruning of a Levy tree (left) and tree T R obtained by regrafting 
on a branch (right). The marks are numbered according to their order of 
appearance. 



We define an equivalence relation on the tree T: x ~ y if the function 9 remains constant 
on the path between x and y. We consider the equivalence classes (7" 1 , i G I R ) and denote for 
each i E I R by 9{ the common value of the function 9. In the pruning procedure described 
above, the tree T l corresponds to the sub-tree which is removed at time 9i and it is distributed 
according to N^ 9 * . Then we consider a branch B R of length rooted at some end point, say 
0. The sub-tree T % is grafted on B R at distance @g i from the root, see Figure [TJ Let T R 
denote this tree obtained by regrafting. Our main result, see Theorem 13.11 relies on Laplace 
transform computations and can be stated as follows. 

Theorem. Assume the Grey condition holds. Under N^, (B R ,T R ) is distributed as (B,T) 
where B is a branch from the root to a leaf chosen at random on T according to the mass 
measure m T . 

In particular, this theorem implies the following corollary. 

Corollary. Under N^[d7~], is distributed as the height H of a leaf of the Levy tree chosen 
at random according to the mass measure mJ . 
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A probabilistic interpretation of those results for the Brownian CRT is provided in [7] 
using a path transformation on Brownian bridge or in [11] using a fragmentation tree. We 
do not know if such an approach is valid in the present general framework. 

Using the Bismut decomposition of Levy trees, we recover and extend to general Levy trees 
Proposition 8.2 from [2] on the asymptotics of the masses of (T l ,i £ I R ). For i £ I R , set 
a 1 = m r (T i ). 

Corollary. Assume the Grey condition holds. N^-a.e., we have: 

L 1 ielR 



Similar results hold for the convergence of 



N*[oa { „<„ } ] 

Those results generalize Proposition 8.3 from [2]. 



Y,iei R ^Ijo-^e} to ©) see Corollary 



J:' 



Figure 2. Bismut decomposition of a Levy tree. 



The above theorem states that the point process with atoms (Qg^T 1 ), i 6 I R is distributed 
as the point process that appears in the Bismut decomposition of a Levy tree. This may 
seem quite surprising. Indeed, if Q{ < 6j, then T l is stochastically greater than T- 7 (as a 
tree distributed according to N^ q can be obtained from a tree distributed according N 1 ^' for 
q > q' by pruning). Consequently, the trees that are grafted on B R are in some sense smaller 
and smaller whereas the trees in the Bismut decomposition have the same law. However, the 
intensity of the grafting is not uniform in the first case (contrary to the Bismut decomposition) 
and depends on the size of the trees grafted before, which gives at the end the identity in 
distribution. 

In the present work, we ignore the marks that fall on the sub-trees once they have been re- 
moved. However, we could use them to iterate our construction on each sub-trees (T l ,i G I ) 
and so on, in order to generalize to general Levy trees the result obtained for Aldous's CRT 
by Bertoin and Miermont |12| . 

In view of the present work, we conjecture that similar results to [25J hold for infinite 
variance offspring distribution. Let X n denote the number of cuts needed to isolate the root 
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by pruning at edges a Galton- Watson tree conditioned to have n vertices. We also consider 
the pruning at vertices inspired by [3], which is the discrete analogue of the continuous 
pruning: pick an edge uniformly at random and remove the vertex from which the edge 
comes from together with the sub-tree attached to this vertex. Let X n be the number of 
cuts until the root is removed by this procedure for a Galton- Watson tree conditioned to 
have n vertices. According to [25], the number of cuts needed to remove the root for the 
pruning at vertices (that is X n ) or to isolate the root for the pruning at edges (that is X n ) 
are asymptotically equivalent for finite variance offspring distribution. However, we expect 
a different behavior in the infinite variance case. Consider a critical Galton- Watson tree 
with offspring distribution in the domain of attraction of a stable law of index 7 G (1,2]. 
According to [16J or [26J, the (contour process of the) Galton- Watson tree conditioned to 
have total progeny n, properly rescaled, converges in distribution to (the contour process of) 
a Levy tree under [ • |er = 1], with ip(X) = C0A 7 for some c$ > 0. 

Conjecture. Let L n denote the length of the rescaled Galton- Watson tree conditioned to have 
total progeny n. We conjecture that: 

7 ; — * 

L n n— )-+oo 

for some random variable Z distributed as the height of a leaf chosen at random according to 
the mass measure under N^[- \o~ = 1]. 

Set a = (7 — l)/7- Using Laplace transform (see Theorem 12. ip . we get that the height H 
of a leaf randomly chosen in the Levy tree is distributed under as o~ a Z, with Z and a 
independent and the distribution of Z is characterized for n G N by: 

Erz n 1= 1 r(a)r(n + i) 

c «/7 7 „ T(a(n + 1)) 

In the particular case of the Aldous CRT, 7 = 2 and cq = 1/2, we recover, using the 
duplication formula of the gamma function, that Z (and thus H under N^[- \a = 1]) has 
Rayleigh distribution. 

The paper is organized as follows. We collect results on Levy trees in Section [21 with the 
Bismut decomposition is Section 12.71 and the pruning procedure in Section 12.81 The main 
result is then precisely stated in Section [3] and proved in Section [H 

2. Levy trees and the forest obtained by pruning 

2.1. Notations. Let (E,d) be a metric Polish space. For x G E, 5 X denotes the Dirac 
measure at point x. For \x a Borel measure on E and / a non-negative measurable function, 
we set (/I, f) = f f(x)n(dx) = //(/). 

2.2. Real trees. We refer to [HUE! [29] for a general presentation of R-trees and to [19] or 
|22j for their applications in the field of random real trees. Informally, real trees are metric 
spaces without loops, locally isometric to the real line. More precisely, a metric space (T, d) 
is a real tree if the following properties are satisfied: 

(1) For every s,t G T, there is a unique isometric map f s j from [0,d(s,t)] to T such that 
/ S) t(0) = s and f s>t (d(s,t)) = t. 

(2) For every s, t G T, if q is a continuous injective map from [0, 1] to T such that q(0) = s 
and q{l) = t, then g([0, 1]) = f S)t ([0,d(s,t)}). 
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If s, t G T, we will denote by [s, i] the range of the isometric map f s>t described above. We 
will also write [s,i| for the set {s,t} \ {t}. 

We say that (T, d, 0) is a rooted real tree with root if (T, d) is a real tree and G T is a 
distinguished vertex. 

Let (T, c£, 0) be a rooted real tree. If x G T, the degree of x, n(x), is the number of connected 
components of T \ {x}. We shall consider the set of leaves Lf(T) = {x G T\{0}, n(x) = 1}, 
the set of branching points Br(T) = {x G T, 77,(2;) > 3} and the set of infinite branching 
points is Br oc (T) = {x G T, n(x) = 00}. The skeleton of T is the set of points in the tree 
that aren't leaves: Sk(T) = T\Lf(T). The trace of the Borel cr-field of T restricted to Sk(T) 
is generated by the sets [s, s'J; s,s' G Sk(T). Hence, one defines uniquely a cr-finite Borel 
measure t T on T, called length measure of T, such that: 

£ T (U(T)) = and ^([s, s'j) = d(s, s'). 

For every x G T, [0, x] is interpreted as the ancestral line of vertex x in the tree. We define 
a partial order on T by setting x =^ y (x is an ancestor of y) if x G [0, yj. If x, y G T, there 
exists a unique z £ T, called the Most Recent Common Ancestor (MRCA) of x and y, such 
that [0, x] n [0, yj = [0, zj. We write z = x A y. 

2.3. Measured rooted real trees. We call an w-tree a weighted rooted real tree, i.e. a 
quadruplet (T, d, 0, m) where (T, (i, 0) is a locally compact rooted real tree and m is a locally 
finite measure on T. Sometimes, we will write (T, c? r , T , m T ) for (T, d, 0,m) to stress the 
dependence in T, or simply T when there is no confusion. We denote by T the set of w-trees. 

In order to define a tractable distance on w-trees, we need an equivalence relation between 
two w-trees, i.e. we identify two w-trees (T, ci T , T , m T ) and (T", d r , T , m 71 ) if there exists 
an isometric function which maps T onto T", which sends T onto T and which transports 
measure m T on measure m T . We will denote by T the set of measure-preserving and root- 
preserving isometry classes of w-trees. One can define a topology on T such that T is a Polish 
space, see for example (211 [2U E]. 

Let T, T' G T be w-trees that belong to the same equivalence class. Let ip be a measure- 
preserving root-preserving isometry that maps T onto T' . A w-tree- valued function F of the 
form F(T, (xi,i G /)) where (xi,i G J) is a family of points of T is said to be T-compatible if 
F(T', (ip(xi),i G /)) belongs to the same equivalence class as F(T, (xi,i G /)). 

Let T G 1. For x G T, we set h(x) = d(0, x) the height of x and 

(1) H max (T) = sup /i(x) 

the height of the tree (possibly infinite). Remark that for two w-trees in the same equivalence 
class, the heights are the same, hence i/ max (T) is well-defined for T G T. 
For a > 0, we set: 

T{a) = {x G T, d(0, x) = a} and vr a (T) = {x G T, d(0, x) < a}, 

the restriction of the tree T at level a and the truncated tree T up to level a. We consider 
7r a (T) with the induced distance, the root and the mass measure m Ka( - T ^ which is the 
restriction of m T to Ti a (T), to get an w-tree. Let us remark that the map Tr a is T-compatible. 
We denote by (T l '°,i G I) the connected components of T \ ir a (T). Let 0j be the MRCA of 
all the points of T l, °. We consider the real tree T l = T r, ° U {0j} rooted at point 0j with mass 
measure m Tl defined as the restriction of m T to T\ We will consider the point measure on 
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d T (s,t) 



T x T: 

iei 

2.4. Grafting procedure. We will define in this section a procedure by which we add (graft) 
w-trees on an existing w-tree. More precisely, let T G T and let ((Tj,Xj),i G /) be a finite or 
countable family of elements of T x T. We define the real tree obtained by grafting the trees 
Ti on T at point Xi. We set T = T U (|J i6 j Tj\{0 Tl }) where the symbol U means that we 
choose for the sets (Tj)j S j representatives of isometry classes in T which are disjoint subsets 
of some common set and that we perform the disjoint union of all these sets. We set T = $ T . 
The set T is endowed with the following metric d T : if s,t € T, 

'd T (s,t) if s,teT, 

d T (s,Xi) + d T *(Q) T *,t) H SET, t€ Ii\{0 Ti }, 

d T *(s,t) if s,i G Tj\{0 Ti }, 

I d T (a* , x,- ) + ^ (0 T i , s) + (0 T » , i) if i + j and s G Tj \{0 T ^ } , t G r 4 \{0 Tl } . 

We define the mass measure on T by: 

m r = m T + ( 1 TA{0 T *} mT< + mTl ({^ 1 })^ 

where 5 2 is the Dirac mass at point x. We will use the following notation: 

(2) (f,d f ^ f ,m f )=T® l£l (T i ,x l ). 

It is clear that the metric space (T, d T , $ T ) is still a rooted complete real tree. Notice that it 
is not always true that T remains locally compact or that m T defines a locally finite measure 
on T. For instance if we consider the grafting {0} © n gN (T, 0) where T is a non-trivial tree 
(i.e. we graft the same tree an infinite number of times on a single point), then the resulting 
tree is not locally compact. 

It is easy to check that this grafting procedure is T-compatible. 

2.5. Excursion measure of Levy tree. Let ip be a critical or sub-critical branching mech- 
anism defined by: 

(3) V(A) = a\ + /3X 2 + [ (e~ Xr -1 + Xr) vr(dr) 

J(0,+oo) v 1 

with a > 0, P > and tt is a cr-finite measure on (0, +oo) such that f, +00 - ) (rAr 2 )Tr(dr) < +oo 
and (it, 1) = +oo if /3 = 0. We also assume the Grey condition: 

f + °° dX 

(4) J m <+ao - 

The Grey condition is equivalent to the a.s. finiteness of the extinction time of the CSBP. 
This assumption is used to ensure that the corresponding Levy tree is compact. Let v be the 
unique non-negative solution of the equation: 

, 30 dX 
Va > 0, / — ttt = a. 



Iv(a) -0(A) 

We gather here results from [18], Theorem 4.2, Theorem 4.3, Theorem 4.6, Theorem 4.7. 
Remarks of pages 575 and 578 of [18] state that the local time l a is a function of the tree 
(see the third property below) and hence can be defined on T. 



8 ROMAIN ABRAHAM AND JEAN-FRANQOIS DELMAS 

Using the coding of compact real trees by height functions, we can define a a- finite measure 
N^[c£7~] on T, or excursion measure of Levy tree, with the following properties. 

(i) Height. Recall Definition ([I]) of the height H mSbX (T) of a tree. For all a > 0, 

N^[# max (T) >a] = v(a). 

(ii) Mass measure. The mass measure m7~ is supported by Lf(T), N^[dT]-a.e. 

(iii) Local time. There exists a process (£ a ,a > 0) with values on finite measures on 
7", which is cadlag for the weak topology on finite measures on T and such that 
N^[dT]-a.e.: 

/■oo 

(5) m T (dx) = / £ a (dx)da, 

Jo 

£° = 0, inf{o > 0; T = 0} = sup{a > 0; £ a ^ 0} = H max (T) and for every fixed a > 0, 

• The measure £ a is supported on T{a). 

• We have for every bounded continuous function <j) on T: 

(r,0) = 4™^y / ^{H^tT^K^dT) 

= 40 ^) / ^Vm^TO^j^e^tfr'), if a > 0. 

Under N 1 ^, the real valued process ((£ a ,l),a > 0) is distributed as a CSBP with 
branching mechanism tp under its canonical measure. 

(iv) Branching property. For every a > 0, the conditional distribution of the point 
measure Mj{dx,dV) under N^[dT\H milx {T) > a], given 7r a (T), is that of a Poisson 
point measure on T(a) x T with intensity r(c£r)N^[cfH]. 

(v) Branching points. 

• N^[dT]-a.e., the branching points of T are of degree 3 or +oo. 

• The set of binary branching points (i.e. of degree 3) is empty a.e if /3 = 
and is a countable dense subset of T if /? > 0. 

• The set Br 00 (7") of infinite branching points is nonempty with N^-positive mea- 
sure if and only if ir ^ 0. If (ir, 1) = +oo, the set Br^T) is N^-a.e. a countable 
dense subset of T ■ 

(vi) Mass of the nodes. The set {d(0, x), x E Br^CT)} coincides N^-a.e. with the 
set of discontinuity times of the mapping a i— > £ a . Moreover, N^-a.e., for every such 
discontinuity time b, there is a unique Xb G Br^T) n T(b) and A& > 0, such that: 

£ b = £ b ~ + A b 5 Xb , 

where A;, > is called the mass of the node Xb- Furthermore A;, can be obtained by 
the approximation: 

(6) A b = lim — !— n(x fe , e), 

e->o v(e) 

where n(xb,e) = f li x=Xb \\iH max M-i\ >t \M^ {dx,dT') is the number of sub-trees orig- 
inating from Xb with height larger than e. 

In order to stress the dependence in 7~, we may write £ a ^ for £ a . 
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We set o" 7 " or simply a when there is no confusion, the total mass of the mass measure on 

T: 

(7) a = m r (T). 

In particular, as a is distributed as the total mass of a CSBP under its canonical measure, 
we have that N^-a.s. a > and for q > (see for instance [27] . Corollary 10.9 for the first 
equality, the others being obtained by differentiation): 

<(«) 



(8) 



1 



a e 



-ip{q)cr 



1 

WW) 



and 



a 2 e -^( 9 )a 



3 ' 



The last two equations hold for q = if ^'(0) > 0. 



2.6. Other measures on T. For each r > 0, we define a probability measure Pr on T as 
follows. Let r > and X/fceiC^T* be a P°i sson point measure on T with intensity rN^. 
Consider {0} as the trivial w-tree reduced to the root with null mass measure. Define 
T = {0} ©fce/c (T fc ,0). Using Property (i) as well as (jHJ), one easily gets that for every 
e > there is only a finite number of trees T k with height larger than e. As each tree 7~ fc is 
compact, we deduce that T is a compact w-tree, and hence belongs to T. We denote by Pr 
its distribution. Its corresponding local time is defined by t a = ^fcgAC anc ^ ^ s total mass 
is defined by a = YlkeK ^ ■ Under Ft, the real valued process ((£ a , l),a > 0) is distributed 
as a CSBP with branching mechanism ip with initial value r. 



We consider the following measure on T: 
(9) N^[dT\ = 2/3N^[dT] + 



V7T 



(dr)Ff(dT) 



which appears as the grafting intensity in the tree- valued Markov process of [4]. From 
and ([9]), elementary computations yield for q > 0: 



(10) 

as well as 
(11) 



N 



1 



-i/>(q)a 



a e 



-ip(q)a 



V(q) 



and 



1 



iP'(q) 



The last two equalities also hold for q = if i/j'(0) > 0. 



2.7. Bismut decomposition of a Levy tree. We first present a decomposition of T G % 
according to a given vertex x G T. We denote by (T J, °, j G J x ) the connected components 
of T\ [0,x]. For every j G J x , let be the MRCA of and consider T-? = T-?'° U {x.,} 
as an element of T with mass measure the mass measure of T restricted to T 3 '°. In order to 
graft together all the sub-trees with the same MRCA, we consider the following equivalence 
relation on J x : 



J ~ J <= 

Let I x be the set of equivalence classes. For 



l B 



for the common value of x, 



with j G We consider {x^} as an element of T with mass measure m ({x^})5 x ,.y 
[i] G I x , we consider the following element of T defined by: 



For 



T 



B,\i] 



{x {{[ } ® jm (T J ,x H ). 
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Let hw = d($,X[q). We consider the random point measure M% on M + x T defined by: 

mis 

Under N^, conditionally on T, let U be a T- valued random variable, with distribution 
a ~\ m T j n other words, conditionally on T, U represents a leaf chosen "uniformly" at 
random according to the mass measure m 7 ". We define under N 1 ^ a non-negative random 
variable and a random point measure on R + x T as follows: 

(12) 



H = d T ((b T , U) and Z 



B 



Mjj. 



Let us remark that the distribution of (H, Z B ) does not depend on the choice of the repre- 
sentative in the equivalence class and thus this random variable is well defined under N^. 

By construction, for every non-negative measurable function <3? on R+ x T and for every 
A > 0, p > 0, we have: 



ere 



-\a-pH-(Z B ,*) 



m (dx) e 



T 



-Xa-ph(x)-(M% 



As a direct consequence of Theorem 4.5 of [18], we get the following result. 

Theorem 2.1. For every non-negative measurable function on M + x T and for every A > 0, 
p > 0, we have: 



ere 



-\a-pH-(Z B ,*) 



da e 



-pa 



where 
(13) 



5 (A,n) =V'(0)+N^ 



cxp 



-Ao--$(u,r) 



g(X, u)du 



In other words, under [a, dT], if we choose a leaf J7 uniformly (i.e. according to the 
normalized mass measure m 7 "), the height H of this leaf is distributed according to the 
density da e~^' ^ a and, conditionally on H, the point measure Z B is a Poisson point process 
on [0,H] with intensity N^[dT\. 

2.8. Pruning a Levy tree. A general pruning of a Levy tree has been defined in [6]. We 
use a special case of this pruning depending on a one-dimensional parameter 6 used first in 
[30] to define a fragmentation process of the tree. 

More precisely, under N^[dT], we consider a mark process M^~(d6,dy) on the tree which 
is a Poisson point measure on M + x T with intensity: 



l [0t+oo) (9)M 2pi T (dy) + Yl ^WV) 

\ xGBroo(T) 

The atoms {0i,yi)iel °f this measure can be seen as marks that arrive on the tree, y% being 
the location of the mark and 6i the "time" at which it appears. There are two kinds of 
marks: some are "uniformly" distributed on the skeleton of the tree (they correspond to the 
term 2(31^ in the intensity) whereas the others are located on the infinite branching points 
of the tree, an infinite branching point y being first marked after an exponential time with 
parameter A y . 

For every x G T, we set: 

9{x) = inf{# > 0, Af T ([0, 9} x [0, xj) > 0}. 
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The process (6(x),x 6 T) is called the record process on the tree T as defined in pQ. This 
corresponds to the first time at which a mark arrives on [0, x\. Using this record process, we 
define the pruned tree at time q as: 

T q = {x G T, 6{x) > q} 

with the induced metric, root and mass measure the restriction of the mass measure mT. 
If one cuts the tree T at time 9i at point yi, then T q is the sub-tree of T containing the root 
at time q. Here again, the definition of T~ q is T-compatible. 

Proposition 2.2. fjg], Theorem 1.1) For q > fixed, the distribution ofT q under is N^ 9 
with the branching mechanism ip q defined for A > by: 

(14) %{\) = ^(A + q) - j,(q). 

Furthermore, the measure N^ 9 is absolutely continuous with respect to N 1 ^, see [2], Lemma 
6.2: for every q > and every non- negative measurable function F on T, we have 

(15) [F(T)} = \F(T) e-^ q)a 

We shall refer to this equation as the Girsanov transformation for Levy trees as it corresponds 
to the Girsanov transformation of the height process (which is Brownian) in the quadratic 
case 7r(dr) = 0. This transformation corresponds also to the Essher transformation for the 
underlying Levy process used in [17] to define the height process in the general case. We 
deduce from definition ([9]) of N^, that for any measurable non- negative functionals F and 
q > 0: 



(16) [F(T)\ = \f(T) e 



Making q vary allows us to define a tree- valued process (T q , q > 0) which is a Markov process 
under N^, see [2] Lemma 5.3 stated for the family of exploration processes which codes for 
the corresponding Levy trees. The process (T q ,q > 0) is a non- increasing process (for the 
inclusion of trees), and is cadlag. Its one-dimensional marginals are described in Proposition 
12.21 whereas its transition probabilities are given by the so-called special Markov property 
(see [6] Theorem 4.2 or [2] Theorem 5.6). The time-reversed process is also a Markov process 
and its infinitesimal transitions are described in [5] using a point process whose definition we 
recall now. We set: 

{e h iei R } 

the set of jumping times of the process (T$, 9 > 0). For every i 6 I , we set 7~*'° = 7^- \ T$ t 
and denote by x; the MRCA of T { '°. For i £ I R , we set: 

T = V'° u { Xi } 

which is a real tree with distance the induced distance, root Xi and mass measure the re- 
striction of m 7 " to T % . Finally, we define, conditionally on To, the following random point 
measure on To x T x 1R_|_: 

N = Yl S (x i ,T i ,6 i )- 
iei R 

Theorem 2.3 ([4], Theorem 3.2 and Lemma 3.3). Under ; the predictable compensator 
of the backward point process defined on R + by: 



l{d<q>}N{dx,dT, dq') 
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with respect to the backward left- continuous filtration T = (J-g,9 > 0) defined by: 
Tq = A), i G I R A >9) = a(T g ~, q > 9). 

is given by: 

n(dx,dT,dq) = m T ^{dx)n^{dT]l {q >^dq. 

And for any non-negative predictable process (j) with respect to the backward filtration T , 
we have: 



Af{dx, dT, dq)4>(q,Tq,T q -) 



fj,(dx, dT, dq)(f)(q,T g ,T q © (T, a;)) 



3. Statement of the main result 



We keep the notations of the previous section. First notice that for i G I R , 9{x) = 9i for 
every x E T % ■ We set o % = m T (T t ) = — ag t and a q = m (Tq) the total mass of T q - By 
construction, we have for every q > 0: 



0", 



•/ - Yl 1 {^><?} (ji 



We set: 



(0(x) — q) m T (dx). 



The quantity := ©o appears in [lj as the limit of the number of cuts on the Aldous' CRT 
to isolate the root. Since 9(x) is constant on T l , we get: 



i£l R 



o~ r dr. 



For simplicity, we write for Go and a for o~q. 

We consider the random point measure Z R on M + x T defined by: 



(17) 



i€l R 

Recall the definition of H and Z B of Subsection 12.71 

The main result of the paper is the next theorem that identifies the law of the pair (H, Z B ) 
and the pair (Q,Z). 

Theorem 3.1. Assume the Grey condition holds. For every non-negative measurable function 
$ on IR + x T, and every A > 0, p > 0, we have: 



a e 



-\o-pH-{Z B ,0) 



a e 



-\<r-f>e-(z R ,$) 



In particular O is distributed as the height H of a leaf chosen according to the normalized 
mass measure on the Levy tree. 

Recall that lim e _>.Q [cr > e] = +oo and lim £ _>.o N^[crl{ .< £ }.] = 0, as well as: 

lim^N^[al {(r < £} ] = +oo 

thanks to Lemma 4.1 from [J3] (which is stated for /3 = but which also holds for /3 > 0). 
The next corollary is a direct consequence of Theorem 13.11 and the properties of Poisson 
point measures for the Bismut decomposition (see Proposition 4.2 in p3] for a proof of 
similar results). 



THE FOREST ASSOCIATED WITH THE RECORD PROCESS ON A LEVY TREE 13 

Corollary 3.2. Assume the Grey condition holds. N^-a.e., we have: 

N^-a.e., /or any positive sequence (e n ,n > 0) converging to 0, i/iere exists a subsequence 



/ r - T ^ 0"*1 r^-i^p 1=0. 



When ^ is regularly varying at infinity with index 7 G (1, 2], the previous convergence holds 

4. Proof of the main result 

4.1. Preliminaries results. We first state a basic lemma. 

Lemma 4.1. Let Mi = Y^jeJi ^ t ^ x j ^ e a P ^ measure on [0, +00). IfY^j^Jx x j < then 
for every r > 0, we /iai>e: 

(18) 1 - exp - ^ 1 {r ] >r}Xj = E l{ rj >r}(l ~ e~^) ex P ~ E 1 {r t >r ] } x l 

\ jeJi J jeJi \ eeJi 

Proof. The result is obvious for J\ finite. For the infinite case, for e > consider the finite 
set: 

Jl,s = {j G Jl, xj > e}. 
Apply Formula (I18p with Ji replaced by Ji )£ and then conclude by letting e tend to thanks 
to monotone convergence and dominated convergence. □ 

Since T q is distributed according to N^ 9 , we deduce from (jHJ) together with (fT6|) that for 
9 > 0: 

(19) N*[aJ = N*[a] = N<V 2 ] = N^[a 2 ] = 

4.2. Laplace transform of (ex, 0,JJ R ). 

Proposition 4.2. Let & be a non-negative measurable function on x T. Assume that 
(Z R ,$) < +00 N*-a.e. and for all A > 0, sup n >Q o(A, u) < +00 u<i£/i g defined by fl3\) . 
Then, for all A > and p > 0, we have: 

(20) \a(p + g{\, 6)) e -^-pB-{z^)l = 1. 

Proof. For every e > 0, q > 0, we set: 

°t = E 1 {^>9} 1 {^>£}°' i ' ®9 = E 1 {^>g} 1 {(T»>£} " J (^ i - Q), 
iei R iei R 

and 

^ = E 1 {fli> 5 } 1 {^> e }*( e ffi.^ < ). z ? = E i^i^e^T), 

so that Zo = (Z R , <&). For every e > 0, q > 0, we set: 

^(A, p) = [l — exp(-A^ - - Zf )] . 
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Since (2 R , $) is finite by assumption, we get that Z £ is finite. We use Lemma B~T1 to get: 



^(A,p) = ^ V> e }(l " exp(-(A + p{9i - q))a l - $(& 0i ,V))) 

iEl R ^ ' 

exp [-J2 1 {e e >e l } 1 {*z>e}((^ + p(Pt~ q))°* + H@e v T e 

Then, if we use Theorem 12.31 (recall that a q = m' q (Tq)), we get: 

c^(A,p)=N^ dra r G £ (\ + p(r-q),Q r ) exp(- (A + p(r - q))a £ - p& £ - 



with 

G*(/e,t) =N^ 
Thanks to (fTTj) and (fT6l) . we get: 

(21) 



1 {CT > £} ( 1 - e — 



< G £ (k, t) < N* [1 {CT > £} ] < 1 [a] = \ ffi- 



Since V 1 " is non-increasing and ^' is non-decreasing, we get that for fixed g > 0, the map 
r i->- <9 r ( ^7fT^ ) is non-negative and bounded for r > q. We deduce from (fTT|) and (fT6|) that: 



l { .> £} ^e— *C«.T) 



We deduce that the map k \— > G £ (n,t) is C 1 and: 
(22) 0<8 K G £ {K,t) = N^ 

We set: 



< i_j_ d f-nr) 



e ip'(r) \ ip'(r) 

H £ tX (q)=^ a r G £ (X + p(r-q),Q r ) expf-(A + p(r - q))<r £ - p@ £ - Z, 



so that: 



+oo 



<p\{\,p) = / K x (q)dr. 



Thanks to flU} and (HHP, we get < H £ x (q) < e"V '(r) /ip' '(r) 2 . This implies in turn that 
^ q (X,p)<e- 1 /^(q). 
For r > 0, k > 0, we set: 

= [a r (8 K G £ (k, Q r ) + a £ G £ {K, O r )) e -«4-i*%-zf] . 
Since a £ < a r , we have, using (fTUP : 



0< J£(k) < ^N^ 



1 

< - 

e 



1 / -£(r) \ 2 <(r) 



ip ! (r) \ ip'(r) J r ip'(r) 
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By monotonicity, we get: 

/ dudsl {u<s} h £ s (X+- p(s - u)) 

J\q,+oo) 2 

duds 1 



< 



< 



[<j,+oo) 2 



{u<s} 



1 



duds - 1 



[<7,+oo) 2 



du 



[q,+oo) 



e 

i)'{uf 



{u<s} 



ij'(s) 2 
1 

ip'(u)' 



d 8 



i)'{ s ) ) v'(s) 4 

ifj'(s) J ip'(u) 2 ij'(s) 2 



1 1 



e ip'(q) 2 

We deduce that the maps u \-t H £ s x {u) and A \-+ H e x (u) are C 1 for A > 0, s > u > q, with: 
d u Hl x (u) = -pd x Hl x {u) and \d x Hl )X {u)\ < h £ s {X + p(s - «)). 

Thus we have Jj ? +QO )2 duds 1{«< S } d u H e s x {u) < p/etp'(q) 2 . Then, elementary computation 
yields: 



m x {q)dr 



r+oo 


r 


I du 


KM ~ / 




J u 



dsd u H e x (u 



, we deduce that: 



We deduce that the maps q h-> <p q (X, p) and A \-+ ip q (X, p) are C 1 and: 

r+oo r+oo 
d q <p E q (\, p) = -Hl x (q) + / ds d u HI }X (q) = -H^q) - pd x / ds H^q). 

J q J a 

With H s q>x (q) = [a q G q (X, Q q ) exp(-A^ - P Q q - Z\ 

(23) d q <p £ q (X, p) = —N^ [a q G q (X, @ q ) exp(-A^ - p% - Z £ q 
We also have: 

(24) a A ^(A, p) = N* [al exp(-A^ - P Q g - Z £ q )] . 
Moreover, thanks to Girsanov formula (|15p . we have: 

^(A, p) = [(1 - exp(-Aag - p% - Z^)) e"^" . 

We deduce that: 

d q ip q (X, p) = -^(q)N^ [a (1 - exp(-Aag - P G £ - ZD) e~^~ 
= -1 + r//(g)N^ [a, exp(-A^ - pQ q - Z\)\ . 



We deduce from OSD and (El that: 



(25) 



i. 



Using Girsanov formula (|16|) and (|1Q|) . we get: 

G-(A, t) < G°(A, t) = g(X + ^(q),t) - ^'(0) - nHi - e"^H = 5 (A + ^(q), t) - ^(q). 
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We deduce that: 



<T g &'(q) + G e JX, e q )) + pa £ q < <J q {swpg{\ + ^(q),t) + p). 



By dominated convergence, letting e decrease to in (|25|). we deduce that: 



<M #(A + 4>{q), ©) + P) exp (-Xa- q - pQ q - Z q 



1. 



Using Girsanov formula (|15p once again, we get: 



o(9& + V>(</), ©) + p) exp (-(A + rP(q))a - pB - (Z R , $)) 



Since A > and g > are arbitrary, we deduce that (|20|) holds. 



1. 



□ 



We deduce the following corollary which states that the pair H and the projection of Z B 
on T have the same distribution as and the projection of Z R on T. 

Let 7 be a non-negative measurable function defined on T. For a measure Z on M + x T, 
we shall abuse notation and write: 

<Z, 7 > = J j(T)Z(dt,dT). 

Corollary 4.3. For every non-negative measurable function 7 on T such that 7(7") = if 
wT{T) = 0, and every A > ; p > 0, we have: 



(26) 



ae 



-\a-pH-(Z B ,7) 



cr e 



-A --pO-(2: ii ,7> 



Proof. Let A > 0. Recall tr = m^(T). First assume that 7(7") < cu for some finite constant 
c. Taking <3?(i,7~) = 7(T) in Theorem 12.11 and using that g(X,u) doesn't depend on u, we 
get: 

1 



a e 



-\<j-pH-{Z B ,7) 



P + S(A,0) 

Notice that (Z R ,Q) < ca and thus hypotheses from Proposition 14.21 are in force. We deduce 
from Proposition 14.21 that: 

f# [a exp (-Aa - p@ - (Z R ,j))] = ) • 

p + #(A,0) 

Thus equality (|26p holds. Use monotone convergence to remove hypotheses A > and 
7(7") < co" for some finite constant c. □ 

4.3. Proof of Theorem 13.11 Let $ be a measurable non-negative function defined on the 
space R + x T. Let us assume that for every T € T, t \-t $(£, T) is continuous, (Z R ,Q) is 
finite N^-a.s. and that the function g defined by (fT3|) is bounded for any A > as a function 
of u. We set: 



T R (r, h) = 



-{z R &) 



a 



r, e = h 



We deduce from Proposition 14.21 and Corollary 14.31 that for every A > 0, p > 0, we have: 

1 = rf> [a(p + g(X, G)) e- x ^ e ^ zR ^' 
= [<r(p + g(X, &)) e - Xa -P & T R (a, 9) 
(27) = [a (p + g(X, H)) e - x<7 - pH T R {a, H) . 
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Let fi(hi,%) b e a Poisson measure with intensity dh N^[d7~] under some probability 

measure P. For every i E I, we set <7j = m 7 *(7i). Then for every h > 0, we set: 

ieJ 

Equation (|27|) and Theorem 12.11 imply that: 



dh e~^'^ h e- G ^(p + g(X,h)) = l, 



with: 

We deduce that: 
r+oo 

/ dh e 
J o 

with: 



log I E 



-ph 



-tp' (O)h-G(h) 



Mh) r R (a(h),h) 
ph dA(h) 



-oo 2 

- e 



P 



dh e- ph A(h), 



A(h)= du e-^'^ u - G ^ g(\,u). 
Jo 

Since this holds for every p > 0, uniqueness of the Laplace transform implies that: 
(28) A ( h ) = 1 - e -i>'{0)h-G(h) ae 

Since A is continuous, there exists a continuous function G such that a.e. G = G. Since, 
t i—t- <l?(i, T) is continuous, we get that, for every A > 0, u \— > g(X,u) is continuous. Then A 
is of class C 1 and so is G. Moreover, by differentiating ([28]) . we get: 

^(0) + G'(h)=g(X,h). 

Since A(0) = 0, we get (5(0) = 0, and thus ij/(0)h + G(h) = $ g(X,u)du. This implies that: 

r-h 



(29) 

We have: 



g(X, u)du = G(h) + if>'(p)h a.e. 



ae 



-\o-pH-(Z B ,4) 



cZ/t e~ ph ~fo 9{Ku)du 
dh e -(p+^(o))h-G(h) 

dh e~^'^ h E \e- x ^ T R (a(h),h) 

N V> ae - XcT - pH F R (a,H) 
ae -\*- P e r R ( a ,Q) 



a e 



-\a-p0-{Z R ,0) 



where we used Theorem 12.11 for the first and fourth equalities, (|29p for the second, the 
definition of G for the third, Corollary 14.31 (which states that (a, H) and (a, 0) have the same 
distribution under N^) for the fifth, and the definition of F R for the last. 

As [a e~ Xa ] is finite, we can remove using dominated convergence the hypothesis 
(Z R ,$) finite. The function g defined by flED, with $(i,T) replaced by ®(t,T)l {rT < 1/n} , 
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is bounded for any A > as a function of u. Thus, using again dominated convergence, we 
can remove the hypothesis on such that function g defined by (|13p is bounded for any A > 
as a function of u. Then use monotone class theorem to remove the continuity hypothesis on 
$ and end the proof. 
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